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POLYPEPTIDES 

The present invention relates to a polypeptide 
having a sequence corresponding to an antigenic epitope on 
human polymorphic epithelial mucin (PEM), to its production 
and use in diagnosis and therapeutic treatment of cancer. 

In WO88/05054 there is described the discovery of 
a tandem repeat sequence of amino acid residues appearing 
in the human PEM protein expressed by normal or transformed 
epithelial cells . At least in the case of breast cancer, 
malignant cells express an under-glycosylated- form of PEM 
which is detected by the SM3 monoclonal antibody. That 
antibody does not react with fully processed PEM as 
produced by normal cells so that the antibody has uses in 
the diagnosis and therapy of breast cancer . 

Further investigations have surprisingly revealed 
that the epitope recognised by SM3 is a small, continuous 
amino acid sequence occurring within the tandem repeat 
sequence previously described . Identification of the 
epitope permits synthetic preparation of peptides mimicking 
the epitope which will be useful in generating antibodies 
with improved specificity for tumour as opposed to normal 
PEM . 
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The present invention therefore provides a 

(a) polypeptides of formula (I) 

(X} K { (A) a PDTRP(A) n } y {X} 2 (I) 

wherein x and z are the same or different and each is 0 or 

an integer of one or more; 

y is an integer of one or more 

each X, where present, is an independently selected amino 
acid residue; 

m and n are the same or different and each is 0 or one and 
A,P,D,T and R are the internationally recognised one-letter 
symbols for the residues of the amino "acids, alanine , 
proline , aspartic acid, threonine and arginine 
respectively; 

(b) polypeptides containing a sequence immunologically 
analogous to { (A) m PDTRP(A) n } ; 

and ( c) polypeptides as defined in (a) or (b) above bearing 

one or more saccharide moieties ; 

and pharmaceutically accept salts thereof ; 

other than a polypeptide specifically disclosed in 

WO88/05054 and other than beta-glucuronidase , which also 

contains the sequence PDTHP . 

Polypeptides as defined under (a), (b) and ( c ) 
above , subject to the same exclusions, will hereafter be 
referred to as "PDTRP epitopes " . 

The simplest polypeptides according to the 
present invention are the pentapeptide PDTRP, immunological 
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analogues thereof and glycosylated derivatives of the 
pentapeptide and its analogues . Hexapeptides according to 
the invention include APDTRP , PDTRPA, immunological 
analogues thereof and glycosylated derivatives of the 
hexapeptides and analogues . Heptapeptides according to- the 
invention include APDTRPA and immunological analogues 
thereof and glycosylated derivatives of the heptapeptide 
and analogues. Larger polypeptides according to the 
invention include additional amino acid residues at the 
amino and/or carboxy terminal of the penta- , hexa- and 
hepta-peptides , analogues and derivatives • Such 
polypeptides may comprise , for instance , up to about 200, 
for instance up to about 50 , particularly up to 30 , 
preferably up to 20 and more preferably from 10 to 15 amino 
acid residues . Maximum and preferred values of x and z are 
determined accordingly- The polypeptides must include at 
least one five amino acid residue sequence PDTRP or an 
immunological analogue thereof and may include several 
repeats of this sequence , for instance 2 or 3 repeats and 
perhaps up to 100 or more such repeats . Each such repeat 
independently may include one or both of the amino and/or 
carboxy terminal alanine residues , or immunological 
equivalents thereof and may be extended at either or both 
the N- and C-termini with other amino acids* Typically 
each repeat will comprise 19 or less amino acid residues . 
Where the polypeptide is glycosylated, any repeats of the 
PDTRP sequence or analogues thereof are independently 
glycosylated , for instance by attachment of a linkage sugar 
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such as N-acetyl galactosamine on the threonine residue of 
PDTRP and/or on threonine or serine residues outside the 
epitope PDTRP sequence or which are included within the 
epitope sequence of an immunological equivalent of PDTRP 
( for instance, wherein the threonine residue has been 
replaced by a serine residue ) . 

Immunological analogues according to the present 
inventions are polypeptides having a sequence corresponding 
to that of formula (I) except that the sequence 
(A) PDTRP (A) is modified at at least one amino acid 

is n 

residue by substitution and/or is modified by insertion of 
one or more additional amino acid residues provided that 
the modified sequence is capable of acting as an epitope 
for the generation of antibodies and provided that the 
antibodies so generated which recognise the modified 
epitope also recognise at least one of 

(a) a polypeptide of formula (I) 

(b) a polypeptide comprising the tandem repeat 
sequence as defined in WO88/05054 and 

(c) human PEM core protein 

but, in each case have reduced or no reaction and 
preferably substantially do not react with fully processed 
human PEM glycoprotein as produced by the normal lactating 
human mamma ry gland. 

Immunological analogues , also known as mimotopes , 
may be designed for instance as described in "Peptide 
Protein and Gene Technology Advances" , Issue No . 2, 
Cambridge Research Biochemicals UK ( copy attached ) and in 
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"The delineation of Peptides able to mimic assembled 
Epitopes" , H . M ♦ Geysen et al in "Synthetic Peptides as 
Antigens" (Ciba Foundation Symposium 119), R * Porter and 
j. whelan (Eds) , Pitman London 1986 pp 130-149 . 

Preferably the polypeptides of the present 
invention are synthetic . 

PDTRP epitopes according to the invention may be 
produced de novo by synthetic methods or by expression of 
the appropriate DNA fragments described below by 
recombinant DNA techniques and expressed with or without 
glycosylation in human or non-human cells . 

A peptide may be built up from single amino acids 
and/or preformed peptides of two or more amino acids in the 
order of the sequence of the desired peptide and optionally 
the resultant peptide may be converted into a 
pharmaceutically acceptable salt if desired. 

Solid-phase or solution methods may be employed . 
In solid-phase synthesis, the amino acid sequence of the 
desired peptide is built up sequentially from the 
C-terminal amino acid which is bound to an insoluble resin . 
When the desired peptide has been produced , it is cleaved 
from the resin . When solution-phase synthesis is employed, 
the desired peptide may again be built up from the 
C-terminal amino acid but the carboxy group of .this acid 
remains blocked throughout by a suitable protecting group , 
which is only removed at the end of the synthesis . 
Whichever technique is employed, each amino acid added to 
the reaction system typically has a protected amino group 



WO 90/05142 PCT/GB89/01340 

and an activated carboxy group. An amino group may be 
protected by the f luoren-9-ylmethoxycarbonyl ( Fiaoc ) or 
t-butoxycarbonyl (Boc ) group. A carboxy group may be 
activated as a pentaf luorophenyl or l-oxo-2-hydroxy- 
dihydrobenzotriazine ester . Functional side-chain groups 
such as the side chain amino group of lysine and the side 
chain hydroxy group of threonine may be protected too. 
Each condensation step may be effected in the presence of 
dicyclohexylcarbodiimide or 1-hydroxybenzotriazole . After 
each step in the synthesis , the amirio-pro tec ting group is 
removed. Side-chain functional group protecting groups are 
generally removed at the end of the synthesis . 

To convert the resultant peptide into a 
pharmaceutical^ acceptable salt, it may be treated with an 
organic or inorganic acid to form an acid addition salt . 
Suitable acids include acetic , succinic and hydrochloric 
acid . Alternatively, the peptide may be converted into a 
carboxylic acid salt such as the ammonium salt or an alkali 
metal salt such as the sodium or potassium salt by use of 
an appropriate base . 

The PDTEP epitopes of the invention may also be 
prepared by recombinant DNA methodologies ; an expression 
vector is prepared which incorporates a DNA sequence 
encoding the PDTRP epitope and which is capable of 
expressing the PDTRP epitope when provided in a suitable 
host. The DNA sequence is located between translation 
start and stop signals in the vector . Appropriate 
transcriptional control elements are also provided, in 
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particular a promoter for the DMA sequence and a 
transcriptional termination site • The DNA sequence is 
provided in the correct reading frame so as to enable 
expression of the peptide to occur in a host compatible 
with the vector . 

Any appropriate host-vector system may be 
employed. For instance , the vector may be a plasmid or 
viral vector and the host may be any prokaryotic or 
eukaryotic cells such as bacterial or yeast cell or cells 
of a mammalian cell line . 

Alternatively the PDTRP epitopes may be obtained 
by stripping carbohydrate from native human mucin 
glycoprotein (which itself may be produced by isolation 
from samples of human tissue or body fluids or by 
expression and full processing in a human cell line ) 
(Burchell et al . , Cancer Research, £7, 5467-5482, (1987), 
Gendler et al . , P.N.A.S. , 84 6060-6064, (1987 ) ) , and 
digesting the core protein . 

PDTRP epitopes linked to carrier proteins such as 
keyhole limpet haemocyanin, albumen or thyroglobulin are 
also within the invention. Linkage may be by known 
chemical methods or by expression of the PDTRP epitope and 
carrier protein as a fusion protein by genetically 
transformed cells as herein described. 

In another aspect the invention provides a 
nucleic acid fragment encoding a PDTRP epitope . 

Fragments according to the invention may be RNA 
or DNA and DNA fragments may be single or double stranded . 
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Pref erred such nucleic acid fragments are DNA fragments 
containing a continuous coding sequence of formula (II) 
which encodes a FDTRF epitope of formula (I) as 
hereinbefore defined, 

[CGX ] 

{XXX} { { GCX ) CCX GAPy ACX [ or ] CCX( GCX) } v {XXX} z (II) 
. ~ [AGPu] y 

wherein m, n, x, y and z are all as defined in relation to 
formula ( I ) ; 

X is any deoxyr ibonucleotide 

XXX is a codon encoding an amino acid residue; 
Py is C or T; 
Pu is A or G; 

and A, C, G and T are the internationally recognised 

one-letter codes for. the four deoxyr ibonucleotide bases . 

Preferred nucleic acid fragments of the invention 
comprise the sequence ; 

(GCC) CCG GAC ACC AGG CCG (GCC) 

m n 

An alternative sequence encoding FDTRP epitopes 
is to be found in the ^-glucuronidase gene of coli and 
contains the . coding sequence 

CCG GAT ACC CGT CCG 
for PDTRP . Fragments containing this sequence and 
optionally further comprising flanking sequences which may 
or may not be related to the g-glucuronidase sequence 
( other than the complete ^-glucuronidase gene and fragments 
specifically disclosed in Proc. Natl . Acad. Sci. USA . 
8447-8451 (1986 ) ) form further aspects of the present 
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invention. 

The invention further provides nucleic acid 
fragments encoding PDTRP epitopes which are immunologically 
analogous to peptides of formula (I) as hereinbefore 
defined . 

These fragments will be useful both for 
generating PDTRP epitopes as described below and as probes 
for detecting DNA encoding PDTRP epitopes. For such 
purposes it may be convenient to attach detectable labels 
( such as radioisotopes , fluorescent or enzyme labels ) to 
the fragment or to bind the fragment to a solid support . 
All of these may be achieved by conventional methods 
(Matthews et al, Anal . Biochem . 169 , 1-25 (1988)}* 

The nucleic acid fragments of the invention may 
be produced de novo by conventional nucleic acid- synthesis 
techniques or obtained from human epithelial cells by 
conventional methods ( Huynh et al , "DNA Cloning: A 
Practical Approach" ed Glover, D.M. ( IRL, Oxford) , 1985 , 
Vol 1 pp49-48 ) . 

In further aspects the present invention provides 
cloning and expression vectors comprising nucleic acid 
fragments as hereinbefore defined (the expression vectors 
having the fragment in open reading frame relation to 
initiation and termination signals and being associated 
with appropriate regulatory and promoter sequences so as to 
form an expressible sequence ) and host cells transformed 
with such cloning or expression vectors . 

The nucleic acid fragments of the present 
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invention may also be used in active immunisation 
techniques. For use in such methods the fragment, which 
codes for a PDTRP epitope may be extended at either or both 
the 5 ' and 3 ' ends with further coding or non-coding 
nucleic acid sequences including regulatory and promoter 
sequences, marker sequences and splicing or ligating sites . 
In addition to encoding the PDTRP epitope , coding sequences 
may code for other portions of the mucin protein chain or 
for other polypeptide chains. The fragment according to 
the invention, together with any necessary or desirable 
flanking sequences is inserted, in an appropriate open 
reading frame register , into a suitable vector such as a 
plasmid or a viral genome ( for instance vaccina vi rus 
genome) and is then expressed as a polypeptide product by 
conventional techniques - In one aspect the polypeptide 
product including the PDTRP epitope may be produced by 
culturing appropriate cells transformed with a vector, 
harvested and used as an immunogen to induce active 
immunity against the mucin core protein (Tartaglia et al . , 
Tibtech , 6 , 43 { 1988 ) } . In another aspect the vector, 
particularly in the form of a virus , may be directly 
inoculated into a human or animal to be immunised . The 
vector then directs expression of the PDTRP epitope in vivo 
and this in turn serves as an immunogen to induce active 
immunity against the mucin core protein . The DNA 
sequence may, in one embodiment, be inserted into another 
gene and expressed as a fusion protein. For example, a 
nucleotide sequence encoding a PDTRP epitope flanked 
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by other amino acid sequences (which are the same as or 
different from the mucin tandem repeat sequence ) is 
expressed in a genetically engineered poliovirus by 
inserting the oligonucleotide into the gene encoding capsid 
protein of the Sabin type 1 vaccine strain of poliovirus , 
as has been done for other protein epitopes ( Evans et al • , 
1989/ Nature 339 , 385-388 ) . Such an engineered virus may 
readily be tested for its ability to induce tumour 
regression in humans or to delay relapse , and ultimately to 
prevent tumour development . 

The invention therefore provides probes , vectors 
and transformed cells for use in the methods of treatment 
of the human or animal body by surgery or therapy and in 
diagnostic methods practised on the human or animal body 
and for use in the preparation of medicaments for use in 
such methods . The invention also provides methods for 
treatment of the human or animal body by surgery or the rapy 
and diagnostic methods practised in vivo as well as ex vivo 
and in vitro which comprise administering such fragments , 
probes , vectors or transformed cells in effective non-toxic 
amount to a human or other mammal in need thereof as well 
as diagnostic methods practised ex vivo or in vitro 
comprising contacting a body fluid or tissue sample with 
the fragments or probes . 

The invention further provides a process for 
producing the PDTRP epitopes comprising cul turing such host 
cells transformed with an expression vector and recovering 
expressed polypeptides . 
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In a further aspect the invention provides 
' antibodies other than SM3 and HMFG-2 capable of recognising 
a PDTRP epitope as hereinbefore defined and preferably 
having reduced or no reaction with fully processed human 
PEM glycoprotein as produced by the normal lactating human 
mammary gland* 

As used herein the term "antibody" is intended to 
include polyclonal and monoclonal antibodies and fragments 
of antibodies bearing antigen binding sites such as the 
F ( ab ' ) fragments as well as such antibodies or fragments 
thereof which have been modified chemically or genetically 
in order to vary the amino acid residue sequence of one or 
more polypeptide chains , to change the species specific 
and/or i so type specific regions and/or to combine 
polypeptide chains from different sources. Such antibodies 
may be obtained by conventional methods (Williams , Tibtech, 
6_ r 36, (1988)) and are useful in diagnostic and therapeutic 
applications , such as passive immunisation. 

Antibodies according to the present invention 
react with PDTRP epitopes as hereinbefore defined; they 
also react with HPEM core protein, especially as expressed 
by colon, lung, ovary and particularly breast carcinomas, 
but have reduced or no reaction with corresponding fully 
processed HPEM. In a particular aspect the antibodies 
react with HPEM core protein but not with fully processed 
HPEM glycoprotein as produced by the normal lactating human 
mammary gland. Antibodies according to the present 
invention preferably have no significant reaction with the 
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mucin glycoproteins produced by mammary epithelial tissues 
lactating or pregnant individuals but react with the mucin 
proteins expressed by mammary epithelial adenocarcinoma 
cells • These antibodies show a much reduced reaction with 
benign breast tumours and are therefore useful in diagnosis 
and localisation of breast cancer as well as in therapeutic 
methods ♦ 

The antibodies may be used for other purposes 
including screening cell cultures for the polypeptide 
expression product of the human mammary epithelial mucin 
gene , or fragments thereof, particularly the nascent 
expression product* In this case the antibodies may 
conveniently be polyclonal or monoclonal antibodies . 

The invention further provides antibodies linked 
by known techniques to therapeutically or diagnosti cally 
effective moieties or to linking moieties . The linking 
moieties may be known linking agents for binding 
therapeutically or diagnostically effective moieties to the 
antibodies , for instance the linking moiety may be a 
specific binding partner for a specific binding agent, the 
therapeutically or diagnosically effective moiety being 
bound to the specific binding agent. Following separate 
administration of the antibody : linking moiety ( specific 
binding partner ) adduct and of the therapeutically or 
diagnostically effective moiety: specific binding agent 
adduct , binding of the specific binding agent to the 
specific binding partner results in the therapeutically or 
diagnostically effective moiety being bound to the 



WO 90/05142 PCT/GB89/01340 

-14- 

antibody. A preferred example of a specific binding agent/ 
specific binding partner pair is biotin and avidin or 
streptavidin in which biotin may be the binding agent or 
the binding partner. 

For therapeutic use of the antibodies the 
therapeutically effective moieties are lethal agents to be 
delivered to cancerous breast or other tissue in order to 
incapacitate or kill transformed cells . Lethal agents 
include toxins/ radioisotopes and "direct killing agents" 
such as components of complement as well as cytotoxic or 
other drugs or enzymes which activate cytotoxic drugs at 
the tumour site . 

For diagnostic use the diagnostically effective 
moieties may be moieties such as solid supports and 
detectable labels such as enzyme labels , chromophores , 
fluorophares and radioisotopes and other directly or 
indirectly detectable labels . Preferably monoclonal 
antibodies are used in diagnosis . 

The invention further provides a diagnostic test 
process or assay method comprising contacting a sample 
suspected to contain abnormal human mucin glycoproteins 
with an antibody as defined above . Such methods include 
tumour localisation involving administration to the patient 
of the antibody bearing detectable label or administration 
of an antibody and f separately, simultaneously .or 
sequentially in either order, administering a labelling 
entity capable of selectively binding the antibody or 
f ragment the reof . 
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Particular uses of the antibodies include 
diagnostic assays for detecting and/or assessing the 
severity of breast, colon, ovary and/or lung cancers • 

Diagnostic test kits are provided for use in 
diagnostic process and assays and comprise antibody and, 
optionally, suitable labels and other reagents such as 
buffers, reagents for labelling the antibody and/or for 
detecting labelled antibody and, especially for use in 
competitive assays , standard sera . 

The invention further provides PDTRP epitopes and 
antibodies as hereinbefore defined for use in methods of 
surgery, therapy or diagnosis practised on the human or 
animal body or for use in the production of medicaments for 
use in such methods . The invention also provides a method 
of treatment or diagnosis which comprises administering an 
effective non-toxic amount of a polypeptide or antibody as 
hereinbefore described to a human or animal in need 
thereof . Dosage rates will depend on the age , weight, 
size , sex and general health of the patient and will be 
sufficient to achieve appropriate levels of circulating 
antibodies without undesirable or intolerable side effects . 

Administration may be by the oral or parenteral 
route , for instance by sub-cubtaneous , intravenous or 
intramuscular injection. Typically a PDTRP epitope is 
administered to a human adult in an amount up to 1 g per 
dose , preferably 1 to 200 mg per dose , by either the oral 
or the parenteral route . 

The invention therefore also provides 
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pharmaceutical compositions comprising a PDTRP epitope of 
the invention and a suitable carrier, diluent and/or 
adjuvant, which will r of course , be determined by the route 
of administration and selected for compatibility with the 
PDTRP epitope . Suitable carriers f diluents and adjuvants 
include Freunds incomplete adjuvant { IFA) , aluminium 
hydroxide, saponin, DEAE-dextran, muramyl dipeptide , 
mineral oils, neutral oils such as miglyol, vegetable oils 
such as arachis oil , "I scorns 11 , liposomes , Pluronic (trade 
mark), polyols or the Ribi adjuvant system (GB-A-2189141 ) . 
For injection the compositions may be in the form of 
aqueous sterile r non-pyrogenic solutions optionally 
containing buffers , antioxidants , biocides such as 
antibacterial s and antifungals and agents to adjust the 
tonicity. Alternatively the compositions may be provided 
as dry powders , optionally containing excipients such as 
those mentioned above , for reconsti tution by addition of 
water for injection . 

Antibodies , whether polyclonal or monoclonal , 
against PDTRP epitope may be used in therapeutic methods 
for treating humans and animals , for instance in the form 
of -vaccines for passive immunisation . Preferably they are 
formulated as pharmaceutical compositions such as 
are described above . 

Especially in therapeutic applications it may be 
appropriate to modify the antibody by coupling the Fab, or 
complementarity-determining region thereof, to the Fc, or 
whole framework region, of antibodies derived from the 
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species to be treated (e.g. such that the Fab region of 
mouse monoclonal antibodies may be administered with a 
human Fc region to reduce immune response by a human 
patient) or in order to vary the isotype of the" 
antibody (see EP-A-0 239 400) • 

The present invention also provides a process for 
producing polyclonal antibodies against a PDTRP epitope 
comprising inoculating a host animal with transfected cells 
capable of expressing the PDTRP epitope and recovering 
antibodies from the serum or other body fluid of the 
animal . 

The present invention further provides a process 
for producing a cell capable of secreting antibodies 
against a PDTRP epitope comprising inoculating a host 
animal with PDTRP epitope or with transfected cells capable 
of expressing the PDTRP epitope and removing 

antibody-secreting cells from the animal . The cells may be 
capable of secreting antibodies either in vitro , for 
instance under appropriate cell culture conditions and/or 
in vivo , for instance by growth in ascites . 

As used herein the term " animal" refers to any 
animal capable of producing antibodies for instance mammals 
such as laboratory rodents, and birds and includes humans • 

Transfected cells are cells containing a DNA 
expression vector containing an expressible sequence 
encoding the PDTRP epitope . The coding sequence may be 
natural or synthetic and the encoded polypeptide may be 
expressed directly or as a fusion protein. 
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For instance , DNA coding for the PDTRP epitope is 
identified and obtained from a natural source or produced 
by partial or total synthesis. Techniques for obtaining 
natural or artificial DNA encoding a particular polypeptide 
are well known in the field of genetic engineering and are 
described in Maniatis , T» , Fritsch, E and Sambrook J . 
(1982) Molecular Cloning, a Laboratory Manual , published by 
Cold Spring Harbor Laboratory Press NY. The vector is 
constructed by conventional methods to ensure expression in 
the cells to be transf ected and may be provided with marker 
sequences enabling identification of the DNA and 
transf ectant cells, (Maniatis loc . cit . } • Selective 
markers which are particularly suitable in the present 
invention include drug resistance markers such as the 
thymidine kinase (tK) gene for tK~ cells . ( Szybalska , E 
and Szybalski W. ( 1962 ) Proc. Natl . Acad Sci . , USA, 42 , 
2026) . 

Suitably, promoter sequences may be inserted in 
the vector to increase expression of the PDTRP epitope in 
the transfectant cells (Subramani , S. , Mullingan, R.C. and 
Berg, P. (1981) Hoi. Cell. Biol. 1, 854-864). 

The cells to be transf ected, i.e. the cells into 
which the vector is to be inserted, (hereafter "donor 
cells" ) are obtained from donor animals , such as rodents , 
especially rats and mice t and primates , for instance 
monkeys . The donor cells must be capable, once 
transf ected, of expressing the PDTRP epitope , optionally 
with post translational processing of the PDTRP epitope . 
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Preferably the donor cells and the optionally modified DNA 
are chosen such that the PDTRP epitope appears 
extracellularly or at the cell surface although it is also 
possible to apply the present invention to polypeptides 
which are only expressed intracellularly . 

It is preferred, especially for intracellularly 
expressed polypeptides , that the donor cells are of a type 
not normally found in the peritoneum or circulating in the 
bloodstream of the host animals ♦ 

Preferably the cells to be transfected are 
derived from an animal (hereafter the "donor animal" ) 
syngeneic with the animal ( hereafter the "host animal " ) 
which is inoculated with the transf ected cells . Without 
wishing to be bound by a particular theory, it is believed 
that by using cells from a syngeneic donor animal , the 
differences between the transfected cells and the host 
animal ' s own cells that can be detected and acted upon by 
the host animal ' s immune system are minimised . Thus any 
immune response is more likely to be directed against the 
foreign polypeptide produced by the transfected cells by 
expression of the DNA sequence . 

To maximise the di recting effect on the immune 
system of the host animal it is desi rable that the host and 
donor animals are substantially genetically identical and 
usually they will be from a well established inbred strain 
such as are well known in the art . Use of monozygotic 
donor and host animals whether or not from an inbred 
strain , or even cells from the host animal itself , is 
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- contemplated. 

Transf ection of donor cells is achieved by 
conventional methods such as using calcium phosphate 
(Austin, P., Trowsdale , J. , Rudd, C. , Bodmer , W. , Feldmann, 
M. and Lamb, J. (1985) Nature , 313 , 1-4 ) or by the 
electroporation technique (Neumann, E . , Schaefer- Ridder , 
M . , Wang, Y . and Hof Schneider , P. H." (1982) EMBO J, 1, 
841-845) or using retroviral vectors (Biotechniques , £, 
608-614 (1988) )• 

After transf ection it may be convenient to screen 
the cells, or clones or sub-clones thereof, to select those 
capable of expressing the PDTRP epitope. With cell 
surface-expressed gene products the use of fluorescent 
antibodies and a fluorescence activated cell sorter * 
is particularly convenient (Austin loc . cit . ) but other 
techniques are available for screening and selection. 

Transf ectant cells , optionally after selection, 
may be treated to enhance the expression of the gene 
product . Various known enhancement methods can be used 
including treatment with sodium butyrate (Gorman, CM. and 
Howard, B.H. ( 1983 ) Nucleic Acids Research, 11, 7631-7648 ) 
or interferon (Balkwill, F.R. , Stevens , M.H. , Griffin, 
D.B * , Thomas, J. A. and Bodmer , J.G. (1987 ) Eur. J. Cancer 
Clin. Oncol ., 23 , 101-106 ) . The enhancement method will be 
selected as appropriate to the type of donor cells . 

The host animal or animals are inoculated with 
PDTRP epitopes or transf ected cells , preferably by 
intraperitoneal or intravenous injection of transf ected 
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cells in a suitable medium. Preferably the inoculation is 
repeated at intervals and the animals' antibody titre is 
monitored to ascertain that the desired immune response 
occurs . The route chosen for administration and the 
inoculation regime are preferably selected in order to 
favour production of the desired class of antibodies , eg 
production of cells secreting IgM antibodies is favoured by 
a short interval between inoculation and recovery of 
antibody-secreting cells, whereas production of cells 
secreting igG antibodies is favoured by a long interval 
between repeated inoculations and eventual recovery of the 
antibody-secreting cells . 

On completion of the inoculation regime antibody 
-secreting cells eg, spleen cells , are taken from the 
- inoculated host animal ( s ) and maintained in an appropriate 
medium. Conventional methods may be used for obtaining and 
maintaining the host cells . Antibody producing cells may 
also be obtained from the body fluids or tissues of host 
animals having carcinoma, particularly lymphocytes from 
excised lymph nodes of human carcinoma , especially breast 
or ovarian cancer, patients . ( JN Immunol . Methods , 105 , 
263-273 (1987)). 

In another aspect the invention provides a 
process for producing a cell capable of secreting 
antibodies against a PDTRP epitope comprising 
removing antibody-secreting cells from a host animal having 
carcinoma or inoculated with a PDTRP epitope or with 
transf ected cells capable of expressing the PDTRP epitope . 



WO 90/05142 _ PCT/GB89/01340 

-22~~ 

In a further aspect the invention provides a ceil 
capable of secreting antibodies against a PDTRP epitope 
which cell has been removed from a host animal having 
carcinoma or inoculated with a PDTRP epitope or with 
transf ected cells capable of expressing the PDTRP epitope . 

It may be convenient at this stage to screen the 
population of antibody-secreting cells obtained from the 
host animal in order to select those secreting antibodies 
against the PDTRP epitope* 

The cells are useful particularly for 
immortalisation in order to form an immortal cell line 
capable of secreting antibodies ( hereafter monoclonal 
antibodies or MAb's) against the PDTRP epitope . 
Immortalisation may be by any of the known methods but 
fusion with an immortal cell to form a hybridoma is perhaps 
the . most conve-nient method . 

Accordingly, the invention also provides a 
process for producing an immortal cell capable of secreting 
antibody against a PDTRP epitope comprising immortalising 
an antibody-secreting animal cell , the antibody-secreting 
animal cell having been taken from a host animal inoculated 
with PDTRP epitope or transf ected cells capable of 
expressing the PDTRP epitope. 

Immortal cells (which cells are capable of 
repeated cell division under appropriate culture 
conditions ) suitable for fusion are well known • The 
immortal cell may be of the same inbred strain as the 
antibody-secreting cells and usually they will be of the 
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same species as the antibody-secreting cells from the host 
animal but this is not essential . Particularly convenient 
immortal cells are myeloma cells. Fusion is achieved by 
the method of Kohler and Milstein, Nature, 256 , 495-497 
( 1975 ) and by variations thereon as described in the 
literature . 

After fusion it may be convenient to screen the 
resulting population of hybrid cells to select those 
secreting antibodies against the FDTRF epitope . 

The present invention also provides an 
immortalised cell , preferably a hybridoma , capable of 
secreting antibodies against a FDTRP epitope , which cell 
comprises an immortalised, antibody-producing cell capable 
of secreting antibodies against the PDTRP epitope taken 
from a host animal inoculated with PDTRP epitope or 
transf ected cells capable of expressing the PDTRP epitope , 
or is a. descendant of such a cell . 

Immortal cells (e.g. hybridoma cells ) according 
to the present invention may be used for producing MAb' s 
against the PDTRP epitope . 

Accordingly, the present invention provides a 
process for producing monoclonal antibodies against a PDTRP 
epitope comprising cul turing an immortalised cell as 
hereinbefore defined capable of secreting antibodies 
against a PDTRP epitope , or a descendant of such a cell , 
and recovering the monoclonal antibodies . 

The invention further provides monoclonal 
antibodies against a PDTRP epitope which antibodies are 
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obtained from growth in tissue culture or ascites of an 
immortal hybrid cell as hereinbefore defined. 

Antibodies produced by the immortalised cells as 
described above may be screened by conventional techniques 
to identify those cells producing MAb's against a PDTRP 
epitope . Such cells may be multiplied and cultured to 
produce MAb's according to the invention in large 
quantities ♦ Suitable culturing techniques and ascites 
growth conditions are well known and are chosen as 
appropriate to the particular immortalised cell line 
involved and the MAb' s to be produced. 

The present invention further provides a fusion. 
- protein comprising a PDTRP epitope as hereinbefore defined 
and a carrier polypeptide or protein bound to the 
C-terminal or the N-terminal of the PDTRP epitope . 

The carrier polypeptide or protein may be any 
suitable polypeptide or protein. Convenient carrier 
proteins are thy roglobulin and bovine serum albumin. 

The fusion proteins of the invention may be 
produced by conventional techniques of polypeptide 
synthesis and protein chemistry from individual amino acid 
precursors or by coupling smaller polypeptides . 
Alternatively the fusion proteins may be produced by 
recombinant DNA techniques involving expression of a gene 
encoding the fusion protein. These me thods are all well 
known to those skilled in the art. 

The invention further provides fusion proteins as 
hereinbefore defined for use in a method of treatment of 
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the human or animal body by surgery or therapy or a 
diagnostic method practised on the human or animal body; 
the use of fusion proteins in the manufacture of a 
medicament for use in the therapeutic, surgical or 
diagnostic treatment of the human or animal body; a 
pharmaceutical composition comprising the fusion protein 
and' a pharmaceutical^ acceptable diluent or carrier 
therefor and, optionally, conventional accessory 
ingredients, and a method for treating the human or animal 
body comprising administering an effective , non-toxic 
amount of a fusion protein to a human or animal in need 
thereof . 

The fusion proteins are particularly useful in 
the treatment or prophylaxis of cancers such as breast 
cancer by increasing the patient ' s immunity to the tumour 
cells and in the production of polyclonal and monoclonal 
antibodies for use in diagnosis and therapeutic treatment 
of cancer . 

The invention will now be illustrated by 
reference to the Figures of the accompanying drawings which 

Fig.l is a graph of the O.D. at 450 nm of a series of 
octapeptide/SM3 reactions in accordance with Example 1. 

Fig • 2 shows the design of overlapping peptide -octamers . 
The starting sequence consisted of a twenty amino acid 
tandem repeat with four amino acids of adjacent repeats on 
the amino and carboxyl ends . Overlapping octamers are 
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indicated [ ] and [A] refers to the alanine used as the 

spacer on the pins. Note, octamers 1 and 21 are identical. 
The one letter code for the amino acids is used. 

Fig. 3 is a graph of the binding of the monoclonal antibody 
SK-3 to octamer peptides . Monoclonal antibody ( 50 yg/ml, 
100 #1 per pin) was reacted with octamers 1-21 (defined in 
Fig. 2) and the binding visualized using an EL ISA assay. 
The boxed sequences in the insert represent the epitope 
recognized by SM-3 . 

Fig. 4 is a series of graphs showing the binding of 
monoclonal antibodies HMFG-2, LICR-LON-M8 and onc-M15 to 
octamer peptides. Monoclonal antibody { 50 /jg/ml, 100 //I per 
pin for HMFG-2 and onc-H15 ; 1:500 dilution of LICR-LON-M8 
ascites fluid) were reacted with the octamers 1-21 (defined 
in Fig. 2 ) and binding visualized using an ELISA. The 
sequence in brackets indicates the epitope . 

Fig. 5 is a diagrammatic representation of epitopes 
recognised by monoclonal antibodies reactive with the PEN 
mucin core protein. Potential glycosylation sites are 
marked with arrows . 

Fig. 6 is a series of graphs showing the binding, of HMFG-2 
to lymphoblastoid and breast carcinoma" cell lines detected 
by fluorescence activated cell sorter . Cell's in suspension 
were incubated with HMFG-2 (-) or TAL-14 . ( • ) an anti 
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HLA-DR monoclonal antibody, and the binding detected using 
a fluorescence labelled rabbit anti-mouse . A, 
lymphoblastoid cell line PGF which expresses HLA-DR2 , the 
g chain contains the sequence Asp-Thr-Arg-Pro ( DTRP ) ; B, 
lymphoblastoid cell line Mann which expresses HLA-DR7 , the 
g chain contains the sequence Asp-Thr-Gln-Pro (DTQP) ; C, 
breast carcinoma cell line T47D which expresses the PEM 
mucin. 

Fig. 7 is a graph of the binding of anti-peptide antiserum 
to octamer peptides . The anti-serum diluted 1:200 was 
reacted with the peptides as described in Fig. 2. The boxed 
sequence in the insert represents the dominant epitope 
recognized by this anti-serum. 

Fig. 8 is a pair micrographs showing staining of breast 
cancer cell lines with antipeptide antiserum. 
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The invention will further be illustrated by the 
following Examples which are not intended to limit the 
invention in any way. 

EXAMPLE 1 

Determination of the epitope recognised by SM-3 

We have previously shown that the monoclonal 
antibody SM-3 recognises an epitope within the 20 amino 
acid tandem repeat found in the PEM core protein ( Gendler 
et al. , J, Biol. Chem . , 263 , 12820-12823 (1988 } ) . Using an 
epitope scanning kit purchased from Cambridge Research 
Biochemical ( Cambridge, England) we have further defined 
the epitope recognised by SM-3 and have shown that the 
minimum epitope consists of 

Proline - Aspartic Acid - Threonine - Arginine - Proline 

The method consists of synthesizing small , 
overlapping peptides on the end of the polypropylene pins, 
96 pins being held in a plastic support in the format and 
spacing of a microtitre plate . The tips of the pins have 
been chemically derivatized to produce functional groups to 
which amino acids may be coupled and the peptides are 
synthesised on to the tips of the pins by the stepwise 
elongation of the peptide chain with the addition of one 
amino acid per day. 

Once the desired length of peptide has been 
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synthesised, the pins can be assayed for their reactivity 
with antibody in a standard ELISA type assay, dispensing 
the reagents into 96 well microtitre plates in which the 
pins are then placed. 

We chose to synthesise overlapping octamers 
moving along the tandem repeat one amino acid with each 
octamer synthesized, and covering all possible octamers so 
that the first and last peptides are identical . Thus to 
design the octamers we used one tandem repeat and 8 amino 
acids of the next repeat i.e, VTSAPDRTPAPGSTAPPAHGVTSAPDTR . 
Thus the first octamer was VTSAPDTK , the second TSAPDTRP , 
the third SAPDTRPA etc. Octamers 2 to 5 are therefore 
"PDTRP" epitopes as hereinbefore defined . Octamers 1 and 6 
to 21 , each of which contains only a part or none of the 
PDTRP sequence and which have little or no reaction with 
SM-3 are not PDTRP epitopes . The results obtained with 
such a set of octamers are shown in Fig . 1. indicating that 
the epitope for SM-3 is PDTRP . 
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EXAMPLE 2 

Many of the monoclonal antibodies which show epithelial 
specificity and recognise tumour associated antigens react with 
epitopes found on large molecular weight mucin molecules. Until 
recently, most of the biochemical characterization of mucins was 
performed using the large mucins produced, for example, in the 
gastrointestinal tract, the lung and the cervix (Hounsell and Feizi, 
1982; Marshall and Allen, 1978; Slayter et al. , 1984; Carls ted t et 
al. ,1983). However, other glands including the mammary , sweat and 
salivary glands produce mucins and one, which we have termed 
polymorphic epithelial mucin (PEM) , appears to be the target antigen 
for many monoclonal antibodies recognising epithelial or carcinoma 
associated determinants (Burchell and Taylor-Papadimitriou, 1989). 

The PEM component is of particular interest because, although it 
is produced by several normal epithelial tissues, and abundantly by 
the lactating mammary gland , it is also expressed by breast and other 
carcinomas (Taylor-Papadimi triou et al. , 1986; Girling et al. , 1989) . 
Moreover , the mucin appears to be glycosylated differently by cancer 
cells, with the result that novel carbohydrate epitopes appear 
(Kjeldsen et al. , 1988) , and core protein epitopes which are masked in 
the normal mucin are exposed on the cancer-associated component. One 
such core protein epitope is detected by the antibody SM-3, which was 
developed using the chemically deglycosylated normal PEM mucin 
(Burchell et al. , 1987). This antibody reacts with more than 90% of 
breast cancers and with the deglycosylated mucin, but shows little or 
no reaction with the normal breast or the native PEM mucin purified 
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from milk (Burchell et al., 1987; Girling et al. , 1989). The 
specificity of the SM-3 antibody makes it a potentially useful tool in 
the diagnosis and treatment of breast cancer, and it was therefore of 
great interest to identify the SM-3 epitope. 

Recently the amino acid sequence of the immunogenic domain of the 
PEM mucin (Gendler et al., 1988), was deduced from the nucleotide 
sequence of cDNA clones coding for this region (Gendler et al., 1987). 
The domain is made up of repeat units, 20 amino acids in length, and 
the number of repeats varies from individual to individual, giving 
rise to the polymorphism seen at the DNA and protein level (Swallow et 
al. , 1987) . The reading frame of the tandem repeat was definitively 
established by showing binding of three core protein reactive 
antibodies (HMFG-1, HMFG-2 and SM-3) to a synthetic peptide with the 
corresponding sequence (Gendler et al. , 1988) . The sequence is in 
agreement with what might be expected for a mucin and contains serines 
and threonines (possible O-glycosylation sites) separated by proline 
rich stretches, suggesting a linear structure. Using overlapping 
oc tamers covering the sequence of the twenty amino acid tandem repeat 
we have now been able to identify a continuous amino acid sequence, 
. Pro~Asp-Thr-Arg-Pro as the core of the SM-3 epitope. Interestingly 
other core protein epitopes which are found on both the normal and 
cancer associated mucin, and are recognized by other antibodies, also 
map to this region, but none contain the proline at the amino end of 
the SM-3 epitope. The data are consistent with the idea that the 
exposure of the SM-3 epitope on the cancer associated mucin can be 
attributed to aberrant processing resulting in shorter carbohydrate 
side chains. Preliminary studies indicate that it is possible to use 
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a peptide corresponding to part of the tandem repeat sequence to 
develop antibodies to the PEM mucin which are reactive with breast 
cancers. 

Materials and Methods 

Peptides: The peptides were synthesized by solid phase techniques on 
a 430A peptide synthesizer (Applied Biosystems), using commercially 
available t-boc protected amino acids and resins as previously 
described (Tovnsend et al. , 1986) . 

Synthesis of overlapping peptide oc tamers: Overlapping peptide 
oc tamers were synthesized onto the end of polypropylene pins using an 
Epitope Scanning Kit (Cambridge Research Biochemicals, Cambridge, 
U.K.) which is based on a method originally published by Geysen et al, 
(1984). The pins are supplied with an alanine attached to the ends 
which acts as a spacer, thus the amino acid at the carboxyl end of the 
synthesized peptide is joined by a peptide bond to alanine. 

Antibodies: The development and characterization of the monoclonal 
antibodies HKFG-1, HMFG-2 and SK-3 (Taylor-Papadimi tr iou et al. , 1981? 
Burchell et al., 1983, 1987; Girling et al. r 1989), LICR~L0N~M8 
(Foster et al., 1982), onc-KIS, onc-M23, onc-M27 (Linsley et al. , 
1988) has been described elsewhere. The monoclonal antibody TAL 14.1 
is directed to HLA-DR. 

Anti-peptide mouse antiserum was produced by immunizing 
intraperitoneally with 25 yg of PDTR peptide 1^24 (see Table 2) 
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coupled to thyroglobulin followed by three injections of 25 yg 
PDTR 1-14 conjugated to bovine serum albumin. 

ELISA: Monoclonal antibodies (50 pg/ml or 1:500 dilution of ascites 
fluid, 100 yl per pin) and polyclonal serum (1:200) were tested for 
their reactivity with the oc tamers in an ELISA assay as previously 
described (Burchell et al. f 1983), using peroxidase conjugated rabbit 
antimouse antibody (DAK0, Denmark) . The octamers were synthesized 
onto pins conferring to the format of a 96 well microtitre plate and 
so for the assay pins were inserted into microtitre dishes containing 
the antibodies or reagents. 

Binding of HHFG-2 to cell lines: Cell pellets containing 10 cells 
were washed with RPMI containing 10% foetal calf serum and suspended 
in 1 ml of hybridoma tissue culture supernatant containing 0.02% 
sodium azide and incubated at 4°C for 1 hour. After two washes with 
RPMI containing 10% FCS, 0.02% azide (wash buffer) the cell pellets 
were suspended in 1 ml of fluorescence conjugated rabbit anti mouse 
(DAK0, Denmark) which had been diluted 1:20 in wash buffer and 
filtered prior to use. After 30 minutes at 4°C the pellets were 
washed once with wash buffer (20 ml) and once with PBSA (20 ml) and 
then suspended in 1 ml of PBSA. Binding of fluorescence conjugated 
rabbit anti -mouse was then analyzed in a Spectra-Physics fluorescence 
cell sorter. 
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Results 

Identification ©f the epitope recognised by aBtihody SM-3 

The sequence of the tandem repeat unit of the PEM mucin is shown 
in Figure Z Arbitrarily the sequence has been written as starting 
vith PDTR since this vas the beginning of the first tandem repeat in 
one of the cDNA clones (Gendler et al. , 1987). The antibody SM-3 has 
been shown previously to react vith the PDTR peptide (1-24) which 
corresponds to one tandem repeat and four amino acids of the next 
repeat (see Table 1). To further define the epitope recognized by the 
SM-3 antibody, a series of overlapping peptide oc tamers, covering a 
single repeat and eight amino acids of adjacent repeats (Fig. 2) vere 
synthesized onto the tips of polypropylene pins (see Materials and 
Methods) . Figure 3 shows the results obtained when, the .peptides were 
reacted with SM-3 in an ELISA. It can be seen that all the peptides 
reactive with the SM-3 antibody contain the sequence 
Pro-Asp-Thr-Arg-Pro which appears therefore to represent the minimum 
epitope. 

» 

Immunogenic! ty of the domain containing the SH-3 epitope 

Monoclonal antibodies reactive with the PEM mucin have been 
developed using a variety of immunogens , including human milk fat 
globule extracts, which contain the mucin, (Taylor-Papadimitriou et 
al. , 1981; Hilkens et al. , 1984; Foster et al. , 1982; Ceriani et al. , 
1983) and carcinoma cells (Bramwell et al. , 1983; Price et al. , 1985) . 
Some were obtained from mice injected with both the mucin and cultured 
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Table 1 

Binding of antibodies to peptides based on the sequence of the PEM mucin 

of tandem repeat 

Antibody Imraunogen Reaction vith Ref 

PDTR peptide 1 PDTR peptide 2 VTSA peptide 3 
(a. a. 1-24) (1-14) (17-20, 1-10) 



HMFG-2 
SM-3 



LICR- 
L0N-M8 



HMFG , milk + 
epithelial cells 

Deglycosylated + 
mucin 

HMFG + 



onc-M15 MCF-7, HHFG 



onc-K23 MCF-7, HMFG 
onc-M27 W5-6, HMFG 



Burchell et 
al., 1983 

Burchell et 
al., 1987 

Foster et 
al., 1982 



Linsley et 
al., 1987 



^PDTR peptide (1-24) PDTRPAPGSTAPPAHGVTSAPDTR 
2 PDTR peptide (1-14) PDTRP APG ST AP P AC* 

3 VTSA peptide (17-20, 1-10) VTSAPDTRPAPGST 
4 HMFG Human Milk Fat Globule 
Cysteine included to allow for coupling 
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cells (Taylor-Papadimitriou et al. , 1981? Burchell et al. , 1983; see 
Table 1). Ve have tested some of these antibodies for their reaction 
vith the peptide coding for the tandem repeat, and found that many 
show a strong positive reaction vith the FDTR peptide (amino acids 
1-24, Table 1). These include the antibodies listed in Table 1 as 
veil as antibodies NCRC11 (Price et al. , 1985) and DF3 (Sekine et al. , 
1985). To identify the domain reactive vith some of these antibodies 
in more detail, their reaction vith smaller peptides vas examined. As 
shovn in Table 1 all of the antibodies tested reacted vith tvo smaller 
peptides, both of vhich contained the first 10 amino acids of the PDTR 
peptide. This region also contains the SM-3 epitope, suggesting that 
this part of the tandem repeat contains epitopes vhich are 
immunodominant in the mouse. The specific epitopes reactive vith 
three of the antibodies shovn in Table 1, namely EMFG-2 , L1CR-L0N-M8 
and onc-15 vere identified in more detail using the overlapping 
octamers illustrated in Figure 2 and the results are shovn in Fig. 4. 
Antibodies HMFG-2 and L1CR-L0N-M8 shov a remarkably similar pattern of 
reaction vith the octamers and in both cases, the minimum epitope 
appears to be Asp-Thr-Arg (DTR) , vhile that recognised by onc-15 is 
Thr-Arg-Pro-Ala (TRPA) . 

The results of the detailed epitope mapping indicate that the 
single threonine and flanking amino acids betveen potential 
glycosylation sites constitute a highly immunogenic domain of the PEM 
tandem repeat. The overlap betveen the different epitopes is shovn in 
Figure 5. 
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Role of adjacent amino acids 

When the PIR protein data base was searched for the epitopes shown 
in Figure, 5 a large number of proteins ranging from bacterial proteins 
to HLA-DR £ chain were shown to contain the sequence Asp-Thr-Arg. 
However the antibodies HMFG-2 and MB have been reported to show a 
restricted reaction with the polymorphic epithelial mucin. To 
investigate whether one of these antibodies would bind to proteins 
containing Asp-Thr-Arg with different flanking amino acids, we studied 
the binding of HMFG-2 to a native protein known to contain the minimum 
epitope. The (3 chain of HLA DR2 contains the sequence Asp-Thr-Arg-Pro 
while the g chain of HLA DP 7 contains the sequence Asp-Thr-Gln-Pro 
(Young et al., 1587), however the adjacent amino acids show no 
homology with the PEM mucin. The binding of HMFG-2 to lymphoblastoid 
cell lines expressing HLA DR2 (PGF cells) and HLA DR7 (MANN cells), 
was analyzed and compared with the binding of HMFG-2 to a breast 
carcinoma cell line expressing the PEM mucin (T47D). Figure 6 s hovs 
that as expected HMFG-2 did not bind to MANN cells expressing DR7 , nor 
did it bind to cells expressing DR2 which contains the sequence 
Asp-Thr-Arg. However, HMFG-2 did show strong binding to the breast 
carcinoma cell line T47D. Thus the minimum epitope for HMFG-2 
consists of only three amino acids in the context of a 
conformationally free peptide while in the native protein, which has 
conformational restraints, the adjacent amino acids have an important 
role in determining the antigenic site. 
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Peptides as immunogens for the development of breast cancer reactive 
antibodies 

Vith the exception of SK-3, the monoclonal antibodies discussed 
above were developed using glycosylated mucin either isolated from 
milk or associated vith cells , and in these cases the carbohydrate 
side chains would play a major role in determining which of the core 
protein epitopes were exposed. Ve have now examined the response of a 
mouse injected with the PDTR-peptide (1-24) and the PDTR peptide 
(1-14) coupled to a carrier protein (see Materials and Methods) , by 
following the reaction of the antiserum with the overlapping oc tamers. 
Figure 7 shows that although the overall binding was higher than with 
the monoclonal antibodies, there appeared to be a peak of binding at 
peptides 4 and 5, This corresponds to antibodies being present in the 
serum that recognize epitopes in the sequence P-D-T-R-P-A-P • The 
antiserum showed a positive reaction with primary breast carcinomas 
and cell lines (see Figure 8), indicating that some of the antibodies 
present reacted with epitopes exposed on the cancer associated mucin. 
This initial experiment suggests that peptides based on the tandem 
repeat sequence can be used as immunogens to develop antibodies 
reactive with the cancer associated PEM mucin. 

Discussion 

The core protein of the polymorphic epithelial mucin is made up of 
a number of tandem repeats, each of which contains five potential 
0-linked glycosylation sites (Gendler et al. , 1988) . These tandem 
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repeats contain the epitopes recognized by many of the monoclonal 
antibodies directed to this mucin. Using overlapping octamers the 
epitope recognized by SM-3, has been defined and has been shown to 
include the single threonine at position 3. Furthermore, epitopes 
reactive with other monoclonal antibodies have also been shown to map 
to this region. It therefore appears that, at least in mice, this 
sequence of the tandem repeat is highly immunogenic. 

Although the epitopes recognised by HMFG-2 , LICR-L0N-M8 , one 15 
and SM-3 are overlapping, the reactivity of these antibodies with the 
mucin is quite different. All react with the mucin expressed by 
breast carcinoma cells but the HMFG-2, one 15 and M8 determinants are 
also expressed on the normal mucin (Burchell et al. , 1983; Linsley et 
al., 1988 ; ' Foster et al., 1982), whereas the SM~3 determinant is not 
(Burchell et al. , 1987; Girling et al. , 1989). The fact that the SM-3 
monoclonal antibody was raised to the chemically deglycosylated normal 
milk mucin suggest that carbohydrate may play a role in masking this 
epitope on the PEM produced by normal cells. The combined epitopes 
recognized by the three antibodies which react with the normal mucin 

cover the sequence, DTRPA , which is centrally placed between the 

18' 1Q f q in 
flanking glycosylation sites <T , S , S , S in figure 2. t see 

also figure 5 ; . However, the SM-3 epitope PDTRP contains a proline at 

the amino end which is only one amino acid away from a glycosylation 

site (serine 19 ) . This suggests that in the normal mucin the 

prol ine on the amino end of the SM— 3 epi tope is masked by carbohydra t e 

while in tumours, because of premature termination of the sugar chains 

19' 18' 
on serine and threonine it is exposed . Recent results lend some 

support to this idea in that the normal mucin isolated from human milk 
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has been shown to contain extended and/or branched 

poly-N-acetyl~lactosamine side chains (Hanisch et al. , 1989) while the 
oligosaccharide side chains of the mucin produced by the breast 
carcinoma cell line, BT20 consists of only 3 or 4 sugars (Hull et al. , 
1988) . 

Since all four of the epitopes mapped contain at their core a 
threonine, this amino acid in the reactive epitopes cannot be 
glycosylated. However it should be noted that fewer HMFG-2 specific 
epitopes are present on the normal mucin found in milk than on the 
cancer associated mucin (Burchell et al* , 1983) . This could mean that 
some of the threonines in the DTR epitope are in fact glycosylated on 
the normal mucin. An alternative explanation is that due to the 
heterogeneity in the length of the carbohydrate side chains, some of 
the DTR epitopes on the normal mucin are masked by long or branched 
polylactosamine chains , while others are not. Another feature of the 
HMFG-2 epitope is that the affinity for the antibody appears to vary, 
depending on the micro environment. Upon comparing the binding of the 
antibody to the normal mucin and to a cancer associated mucin, it is 
found that the antibody shows a higher affinity for the normal mucin, 
suggesting that the normal oligosaccharide side chains are required 
for maximum affinity. That oligosaccharide side chains may be 
important is also shown by the fact that both HMFG-2 and SM-3 bind 
more avidly to the partially deglycosylated mucin than to the mucin 
stripped of all carbohydrate (Burchell et al. , 1987) . 

Although the oligosaccharide side chains may play a role in 
determining the conformation of the core protein epitopes recognized 
by antibodies HMFG-2 and SM-3, it is quite clear that there are 
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antigenic determinants in the core protein of the PEM mucin which are 
immunogenic and are expressed by breast cancers. Using a synthetic 
peptide corresponding to the tandem repeat sequence coupled to a 
protein carrier ve have been able to induce antibodies in the mouse 
which show reactivity with breast carcinomas. Epitope mapping with 
the antiserum indicate that dominant epitopes are found in the 
sequence PDTKPAP between the glycosylation sites at Thr 18 \ Ser 19 ' and 
Ser 9, Thr 10 (Figure 2 ), again confirming the immunogenic nature of 
this domain. These results suggest that it should now be possible to 
take a directed approach to the production of monoclonal antibodies 
with SM-3-like specificity which react with breast cancers and such 
work is now in progress. Furthermore, the identification of an 
epitope which is selectively exposed in carcinoma cells gives a focus 
for studying the aberrant processing of the mucin by tumour cells and 
suggests that it may be possible to use peptides or polypeptides 
containing the tandem repeat as synthetic vaccines cor some human 
cancers. 
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Example 3 

Expression of a partial cDNA clone coding for several tandem repeats 
as a fusion protein in E. coll 

Clone 7 which was isolated by Gendler et al. (PNAS 84, 6060-6064, 
1987) and is made up of approximately 10 tandem repeat units, was cut 
out of the Bluescript vector SK with EcoRl and inserted into the 
expression vector pRX-1 (Rimm and Pollard, Gene 75, 323-327, 1989). 
Selection for expression of the mucin fragment in E. coli DH-1 strain 
vas done by probing lifts of the transf ormants with antibody. DNA 
from plasmid expressing the mucin fragment was prepared and E. coli 
strain CAG-456 (see Rimm and Pollard) vas transformed for higher 
expression of the mucin fragment. The fragment containing 18 a. a. of 
the trpE protein could be detected with antibody on Western blots as a 
fragment of approximately 30 Kd molecular weight. Coomassie blue 
staining of the gel-separated bacterial lysate allowed visualisation 
of this band which was increased in cells induced with indolyl acrylic 
acid, and appeared to make up 10-20X of the bacterial protein. 
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Example .4 

Expression of a partial cDNA clone coding for several tandem repeats 
in a mammalian cell 

Clone 7 was excised with EcoRl from the Blues crip t vector 
(Stratagene, California) and inserted into the EcoRi site of a vector 
based on the pJ3 expression vector so that the correct reading frame 
of the tandem repeat was maintained* Clones vith plasmids containing 
inserts in the correct orientation vere selected using 
oligonucleotides which overlapped the vector insert sequence. DM was 
prepared and purified according to standard methods and transfected 
into Cos cells. After 48 hours cells were dissolved in sample buffer 
for acrylamide gel electrophoresis and gels were blotted on to 
nitrocellulose paper. Probing of the blot with antibody using an 
ELISA to detect bound antibody showed a band around 30 Kd in the 
transfected cells. 

The procedures of Example 3 & 4 may be used to express and 
test PDTRP epitope polypeptides or fusion proteins according 
to the inventon . 
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CLAIMS 



1. 



A polypeptide of formula (I) 



{X} §c {(A) m PDTRP(A) n } y {X} z 



(I) 



wherein x and z are the same or different and each is 0 or 

an integer of one or more; 

y is an integer of one or more 

each X, where present, is an independently selected amino 
acid residue; 

m and n are the same or different and each is 0 or one and 
A,P,D,T and R are the residues of the amino acids, alanine, 
proline, aspartic acid, threonine and arginine 
respectively; or containing a sequence immunologically 
equivalent to the sequence 



as defined above, optionally bearing one or more saccharide 
moieties, or a pharmaceutical^ acceptable salt thereof, 
other than beta-glycuronidase . 



(A) PDTRP(A) 

Jit XI 



2. 



A polypeptide selected from 



(a) 



PDTRP 



APDTRP 
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. So 

PDTRPA 
APDTRPA 

(b) polypeptides (a) above bearing one or more 
saccharide moieties 

and ( c ) pharmaceutical^ acceptable salts of 

polypeptides (a) and (b) above. 

3. A polypeptide according to claim 1 or claim 
2 linked to a carrier protein, 

4 . A polypeptide according to claim 3 
wherein the carrier protein is selected from keyhole limpet 
haemocyanin, albumen and thyroglobulin . 

5 . A nucleic acid fragment encoding a 
polypeptide according to any one of claims 1 to 4 . 

6. A fragment according to claim 5 containing a 
continuous coding sequence of formula (II) 

{XXX} x { (GCX) m CCX GAPy ACX RRR CCX( GCX ) n } y {XXX} z (II) 

wherein RRR is either CGX or AGPu 

{X} x {(A) B PDTRE(A) B } y [X} z 

x and 2 are the same or different and each is 0 or an 

integer of one or more ; 

y is an integer of one or more 

each X, where present, is an independently selected amino 



WO 90/05142 



PCT/GB89/01340 



Si 

acid residue; 

m and n are the same or different and each is 0 or one and 
A,P,D,T and R are the internationally recognised one-letter 
symbols for the residues of the amino acids , alanine , 
proline, aspartic acid, threonine and arginine 
respectively; 

X is any deoxyribonucleotide 

XXX is a codon encoding an amino acid residue ; 
Py is C or T; 
Pu is A or G; 

and A, C, G and T are the internationally recognised 

one-letter codes for the four deoxyribonucleotide bases . 

7. A fragment according to claim 6 comprising 
the coding sequence 

(GCC) CCG GAC ACC AGG CCG (GCC). 

8 . A DNA construct comprising an operational 
gene , encoding a fusion protein , containing a fragment 
according to any one of claims 5 to 7 in expressible form, 

9 . A cloning or expressing vector containing a 
nucleic acid fragment or construct according to any one of 
claims 5 to 8 . 

10* An expression vector consisting .of 
poliovirus genomic DNA containing a coding sequence in 
expressible form encoding a fragment or construct according 
to any one of claims 5 to 8. 

11 . A vector according to claim 10 wherein the 



WO 90/05142 



PCT/GB89/01340 



fragment is contained within the poliovirus capsid protein 
gene • 

12. Host cells transformed with a cloning or 
expression vector according to any one of claims 9 to 11. 

13. A nucleic acid probe comprising a fragment 

according to any one of claims 5 to 7 and a detectable 
label. 

14. An antibody, other than SM-3 and HMFG-2, 
capable of recognising a polypeptide according to any one 
of claims 1 to 4 . 

15 . An antibody according to claim 14 linked to 
a therapeutically or diagnostically effective moiety. 

16 . A pharmaceutical formulation comprising a 
carrier or diluent or adjuvant and a polypeptide according 
to claim 14 or claim 15 . 

17. A cell , other then HSM-3 , capable of 
secreting antibodies against a polypeptide according to any 
one of claims 1 to 4. 

18 . A immortal cell according to claim 17 . 

19. A polypeptide according to any one of claims 
1 to 4, a nucleic acid fragment according to any one of 
claims 5 to 7 , a construct according to claim 8 , a vector 
according to any one of claims 9 to 11, a cell according to 
claim 12, a probe according to claim 13, an antibody 
according to claim 14 or claim 15 , a pharmaceutical 
formulation according to claim 16 or a cell according to 
claim 17 or claim 18 for use in a method for treatment of 
the human or animal body by surgery or therapy or a 



WO 90/05142 



53 



PCIYGB89/Q1340 



diagnostic method practiced on the human or animal body . 

20 . Use of a polypeptide according to any one of 
claims 1 to 4 , a nucleic acid fragment according to any one 
of claims 5 to 7, a construct according to claim 8, a 
vector according to any one of claims 9 to 11, a cell 
according to claim 12 , a probe according to claim 13 , an 
antibody according to claim 14 or claim 15 , a 
pharmaceutical formulation according to claim 16 or a cell 
according to claim 17 or claim 18 in preparation of a 
medicament for use in a method for treatment of the human 
or animal body by surgery or therapy or a diagnostic method 
practiced on the human or animal body. 

21_. A method of treatment or diagnosis 
comprising administering an effective , non-toxic amount of 
a polypeptide according to any one of claims 1 to 4, a 
nucleic acid" fragment according to any one of claims 5 to 
7, a construct according to claim 8 , a vector according to 
any one of claims 9 to 11, a cell according to claim 12 , a 
probe according to claim 13 , an antibody according to claim 
,14 or claim 15 , a pharmaceutical formulation according to 
claim 16 or a cell according to claim 17 or claim 18 to a 
human or animal in need thereof . 

22 • A diagnostic test process comprising 
contacting a sample suspected to contain abnormal human 
mucin glycoprotein with an antibody according to claim 14 
or claim 15 . 

23 • A diagnostic test kit comprising an antibody 
according to claim 14 or claim 15 . 
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